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Abstract: Multi-class classification methods based on both labeled and unlabeled 
functional data sets are discussed. We present a semi-supervised logistic model for 
classification in the context of functional data analysis. Unknown parameters in our 
proposed model are estimated by regularization with the help of EM algorithm. A 
crucial point in the modeling procedure is the choice of a regularization parameter 

cn ■ 

■ involved in the semi-supervised functional logistic model. In order to select the 

a^ 
cn 

(N 

' to examine the effectiveness of our proposed modeling strategy. 



adjusted parameter, we introduce model selection criteria from information-theoretic 
and Bayesian viewpoints. Monte Carlo simulations and a real data analysis are given 
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1 Introduction 

In recent years, functional data analysis has been used in various fields of study such 
as chemometrics and meteorology (e.g., we refer to Ramsay and Silverman, 2002; 2005, 
Ferraty and Vieu, 2006). The basic idea behind functional data analysis is to express a 
discrete data set as a smooth function data set, and then exploit information obtained from 
the set of functional data using the functional analogs of classical multivariate statistical 
tools. Till this day, several researchers have studied a variety of functional versions of 
traditional supervised and unsupervised statistical methods; e.g., functional regression 
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analysis (James and Silverman, 2005; Yao et ai, 2005; Araki et ai, 2009a), functional 
discriminant analysis (Ferraty and Vieu, 2003; Rossi and Villa, 2006; Araki et ai, 2009b), 
functional principal component analysis (Rice and Silverman, 1991; Siverman, 1996; Yao 
and Lee, 2006) and functional clustering (Abraham et ai, 2003; Rossi et ai, 2004; Chiou 
and Li, 2007). 

Meanwhile, a semi-supervised learning, which is a modeling procedure based on both 
labeled and unlabeled data, has received considerable attention in the contemporary 
statistics, machine learning and computer science (see, e.g., Chapelle et ai, 2006; Liang 
et ai, 2007; Zhu, 2008). In particular, it is known that the semi-supervised learning is 
useful in the application areas including text mining and bioinformatics, in which ob- 
taining labeled data is difficult while unlabeled data can be easily obtained. Many of 
ordinary statistical multivariate analyses have been extended into the semi-supervised re- 
semblances by earlier researchers; e.g., semi-supervised regression analysis (Verbeek and 
Vlassis, 2006; Lafferty and Wasserman, 2007; Ng et ai, 2007), semi-supervised discrimi- 
nant analysis (Miller and Uyer, 1997; Yu et al, 2004; Zhou et al, 2004; Dean et ai, 2006; 
Kawano and Konishi, 2011) and semi-supervised clustering (Basu et al, 2004; Zhong, 
2006; Kulis et ai, 2009). 

In this paper, our aim is to extend the supervised modeling procedures for func- 
tional data into semi-supervised counterparts. We, in particular, focus on a multi-class 
classification or discriminant problem, and develop a semi-supervised logistic model for 
functional classification problem. Unknown parameters in the model are estimated by the 
regularization method along with the technique of EM algorithm. A crucial issue for the 
modeling procedure is to choose a value of a regularization parameter involved in the semi- 
supervised functional logistic model. In order to select the optimal value of the regulariza- 
tion parameter, we then introduce model selection criteria based on information-theoretic 
and Bayesian approaches that evaluate semi-supervised functional logistic models esti- 
mated by the regularization method. Some numerical examples including a microarray 
data analysis are illustrated to investigate the effectiveness of our modeling strategy. 

This paper is organized as follows. In Section 2, we consider a functionalization method 



2 



that converts the discrete data into the functional form using basis expansions. Section 
3 proposes a functional logistic model in the context of the semi-supervised multi-class 
classification problem. In this section, we also present an estimation procedure based 
on the regularization method with the help of EM algorithm. Section 4 derives model 
selection criteria to select a regularization parameter in the functional logistic models. 
In Section 5, Monte Carlo simulations and a real data analysis are given to assess the 
performances of the proposed semi-supervised functional logistic discrimination. Some 
concluding remarks are given in Section 6. 

2 Functionalization 

Suppose that we have n independent observations aji, . . . , where consist of the 
observed values x^i, . . . , XaN^ discrete times t^i, . . . , taNa-, respectively. Our aim in this 
section is to express a data set {{xai, tai); i = 1, . . . , Na, tm G T C M} (a = 1, . . . , n) as 
a set of smooth functions {^^(t); a = 1, . . . , n, t G T} by a smoothing technique. In this 
section we drop the notation on the subject Xa, and hence consider a functionalization 
procedure of the data set {(xj, tj); i = 1, . . . , A^}. 

It is assumed that the observed values {(xj, ti); i = 1, . . . , N} for a subject are drawn 
from a regression model as follows: 



where u{t) is a smooth function to be estimated and the errors Ei are independently, 
normally distributed with mean zero and variance a^. We also assume that the function 
u{t) can be represented by a linear combination of pre-prepared basis functions in the 
form 



k=l 

where Uk are coefficient parameters, m is the number of basis functions and (pkit; fik,Vk) 
are Gaussian basis functions given by 





m 




(2) 




= 1, . . . , m. 



(3) 
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Here /i^ are the centers of the basis functions and rj^ are the dispersion parameters. In 
particular, we use Gaussian basis functions proposed by Kawano and Konishi (2007), 
and hence the centers fik and the dispersion parameters rj^ are determined as follows: 
for equally spaced knots so that ti < ■ ■ ■ < T4 = min(t) < ■ ■ ■ < r^+i = max(t) < 
■ ■ ■ < Tm+i, we set the centers and the dispersion parameters as fik = Tk+2 and fj = fjk = 
ijk+2 — '^kj/'i for k = 1, . . . ,m, respectively. For details of the procedure, we refer to 
Kawano and Konishi (2007). 

It follows that the nonlinear regression model based on the Gaussian basis functions 
can be written as 



f{xi\ti;u:,a ) = — ===exp 
V 27r CT"^ 



2a2 



2" 

l,...,iV, (4) 



where uj = {ui, . . . , Um)'^ and <p(t) = (0i(t), . . . , 4>mit))^. The parameters uj and cr^ are 
estimated by maximizing the regularized log-likelihood function in the form 

N 

' log- f(xAty. u:. rr^) — - ^ 



^^{iv,a') = ^log/(x,|t,;a;,a2) - -^iv^JCiv 



i=l 



-— log(27ra2) -^ix- ^ujfix - ^uj) - — cj^/ 



where x = (xi, . . . , xnY , $ = (0(^i), • • • , <PitN))'^ , C (> 0) is a smoothing parameter and 
/C is a positive semi-definite matrix defined by /C = D2D2, where D2 is a second-order 
difference term. The regularized maximum likelihood estimates are given by 

N 

N 



1 ^ 

u; = ($^<|. + NCa'lCr'<^^x, = ^ E (^^ " ■ (6) 



1=1 



We obtain the optimal number of basis functions m and the value of the smoothing 
parameter ( by using a model selection criterion GIG (Ando et ai, 2008) for each smooth 
curve as the minimizer of the form 

GIC(C) = iVlog(27r(T2) + N + 2tr{gi?-^}, (7) 

where o"^ is given in Equation ([6]) and the m x m matrices Q and R are, respectively. 
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Figure 1: Functionalization by Gaussian basis expansions 



given by 



/ 1 
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(9) 



where l^r = (1, • • • , 1)"^ and A = diag [xi — u)'^(p{ti), . . . ,xn — i^'^4>{tN 

Hence, the observed discrete data {(x^j, tai); t^i G T, « = 1, • • • , No] (a = 1, . . . , n) 
are smoothed by the methodology described above, and we obtain a functional data set 
a = 1, . . . , n} given by 



u{t) = = Xa{t), t e T. 



(10) 



k=l 



Figure [T] shows a sketch of the functionalization using Gaussian basis functions. Circles 
represent observed discrete data, the below solid curves basis functions pre-prepared and 
the above solid line the estimated smooth curve. For details of the functionalization step 
in functional data analysis, we refer to Ramsay and Silverman (2005) or Araki et al. 
(2009a). 



3 Semi-supervised functional logistic discrimination 
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3.1 Semi-supervised logistic model for functional data 

In the framework of semi-supervised functional data analysis, we are given ni labeled 
functional data {{xa(t), ga)',<y = l,...,ni, t G T} and (n — ni) unlabeled functional 
data {xa{t) \ a = rii + 1, . . . , n, t E T}. Here Xa{t) are functional predictors given in the 
previous section and G {1, . . . , L} are group indicator variables in which g = k implies 
that the functional predictor Xa{t) belongs to group k. First, a functional logistic model 
is constructed by using only labeled functional data {{xa{t), go); a = 1, . . . , ni, t G T}. 

We consider the posterior probabilities for group k {k = 1, . . . , L) given in a functional 
data Xa{t) as follows: PT{ga = k\xa)- Under these posterior probabilities, Araki et al. 
(2009b) introduced a functional logistic model in the form 

log I l^l^^flj^ }=/?./ + I x^{t)P,{t)dt, k = l,...,L-l. (11) 

By using the same Gaussian basis function (pj(t) as in Equation ([2]), (3kit) is assumed to 
be expanded as 

m 

/3k{t) = Y,PkMt)- (12) 
i=i 

Then we can rewrite the functional logistic model in Equation ( ITTi) using the expansion 
in Equation (fT2|) as follows: 

log jl^l^^flgll =f3kf + l x^imm = f3lz^, (13) 

where /3k = {Pkf, Pki, ■ ■ ■ , PkmY = (1, w'^J)^ . Here J is an m x m matrix with 

the (2, j)-th element 

J., = V^expj-^^^^^^j, z,j = l,...,m, (14) 

where /tj and fj are estimated centers and width parameters included in Gaussian basis 
functions in Section 2, respectively. 
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Thus the conditional probabilities can be rewritten as 

exp{/3^z„} 



L-1 ■ 

l + ^exp{/3j2,} 
1 

L^^l ' 



(15) 



We describe Pi {ga = k\xa) as 7rfc(xQ,;/3), since the probabilities depend on a parameter 

vector /3 = (/3f,...,/3Lir- 

We introduce an {L — l)-dimensional response variable ya = {yi"\ ■ ■ ■ ^Ui'li)^ (« = 
1, . . . ,ni), which indicates that the k-th element of ya is set to 1 if the corresponding 
Xa(t) belongs to the k-th class, for rii labeled functional data {{xa(t), go)] a = 1, . . . , rii}. 
Hence we obtain a multinomial distribution with the posterior probabilities 7ik{xa] (3) as 
follows: 

L-1 

lr^\ . , — . r.-_1 (ry^ 

(16) 



fiya\xa;f3) = ]Jnkixa;f3y'i^^\7rL{xa-,(3)Y-^f-^"yf. 

k=l 



By introducing a dummy class label variable ta for unlabeled functional data {xa{t)] a 
77,1 + 1, . . . , n} given by 



(a) 



' • • • ' "L-l 



(k) 

(0, . . . , 0, 1 , 0, . . . , O)-'" if Xa{t) belongs to k-th class, 
(0, . . . , 0)"^ if Xa(t) belongs to L-th class, 

it is assumed that ta is distributed as the same multinomial distribution with the posterior 
probabilities TTki^a] f3) as in Equation (fT6l) . Also, for unlabeled functional data, we assume 
f3kf + J Xa(t)/3k(t) = 0lza (a = 77-1 + 1, . . . , n; A; = 1, . . . , L — 1) similar to Equation ( |T3ll . 
The log-likelihood function based on both labeled and unlabeled functional data is then 
obtained by 



L-l 



L-1 



- log ( 1 + 5^ exp{A^;^ J 



fc=i 



1=1 



E 



a=n\+l \_k=l 



L-l 



L-1 



E t'^:^0lZa - log ( 1 + E eMPjZa] 



1=1 



(17) 



3.2 Estimation via regularization 

As mentioned in Araki et al. (2009b), the maximum likelihood method often causes some 
ill-posed problems for a functional logistic model; i.e., unstable or infinite parameter esti- 
mates. Then we employ a regularization method to obtain the estimator of the parameters 
included in the functional logistic model. A regularization method achieves to maximize 
a regularized log-likelihood function 

£A(/3)=^(/3)-^$^/3rm, (18) 
k=i 

where A (> 0) is a regularization parameter and K is an (m + 1) x (m + 1) matrix given 
by 

/ 0^ \ 

K={ . (19) 

y K* J 

Here is an m-dimensional zero vector and K* is an m x m positive semi-definite matrix. 
In the section of numerical examples, we use an identity matrix as the matrix K*. 

In maximizing the regularized log-likelihood function in Equation ( !T8|) . it is difficult 
to obtain the estimator of the parameters, since the values of dummy class labels t are 
unknown and dix{f3)/df3 = does not have an explicit solution with respect to the 
parameter vector /3. Hence, we employ a following EM-based algorithm to obtain the 
estimator f3. 

Stepl Initializing the parameter vector /3 by maximizing the regularized log-likelihood 
function via only labeled functional data {{xa{t), Qa)] a = 1, . . . , rii} with the help 
of Fisher's scoring method. 

Step2 Construct a classification rule nk{xa]^)- 

Step3 By the use of the classification rule in Step2, compute the posterior probabilities 
7rfc(xQ,; (3) {k = 1, . . . , L) for unlabeled functional data Xa{t) (a = ni + 1, . . . , n). 
According to the posterior probabilities, estimate ta as follows: 

4 = (tt\ . . . , t£l)^ = M^a, $),..., TlL-lix^; (20) 
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Step4 Replace tj^ into ij^ in the regularized log-likelihood function. Then estimate the 
parameter vector /3 using Fisher's scoring method. 

Step5 Repeat the Step2 to the Step4 until the convergence condition 

|4(/3(fe+i))_£,(/3W)|<10-5 (21) 

is satisfied, where is the value of /3 after the A;-th EM iteration. 

Therefore, we derive a statistical model f{y\x; (3) which is constructed by using both 
labeled and unlabeled functional data. The statistical model includes a tuning parameter; 
i.e., the regularization parameter A. Since the selection of this parameter is regarded as 
the selection of candidate models, we introduce model selection criteria to choose the 
constructed models. 

4 Model selection criteria 

In this section, we derive two types of model selection criteria to evaluate semi-supervised 
functional logistic models from the viewpoints of information-theoretic and Bayesian ap- 
proaches. 

4.1 Generalized information criterion 

Akaike (1974) proposed the Akaike information criterion (AIC), which enables us to eval- 
uate statistical models estimated by the maximum likelihood method. While the AIC 
is very useful for various fields of research, the criterion cannot be directly applied into 
models constructed by other estimation procedures. 

Konishi and Kitagawa (1996) introduced an information criterion, which can evalu- 
ate models constructed by various estimation procedures including robust, Bayesian and 
regularization methods. Using the result of Konishi and Kitagawa (1996), we propose a 
generalized information criterion (GIC) in the context of the semi-supervised functional 
logistic model. The model selection criterion is given as follows: 

ni 

GIC = -2^1og/(y„|x«;/3) + 2tr |g(/3)i?-i(/3)} , (22) 

a=l 
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where the matrices Q(/3) and R{/3) are 



Q{$) = - \{iB -C)Q Af - \E$ll^ {{B -C)Q A}, (23) 



rii 

Til ' ' ' ' ni 



R0) = - — iCQAf{CQA) + —D + XE, (24) 



with 



A = 


(Z,...,Z), m X (m + l)(L-l), 




B = 


{y{l)l^+l, . . . ,y{L-l)'i-rn+l) ' 




C = 


(77(1)1^+1, ... ,7r(i_l)l^+l) , 




D = 


block diag{Z"^diag(7r(i))Z, . . . , Z^diag 


;(7r(L_i))Z} 


E = 


Uockdiag{K,...,K), (m + l)(L 


— 1) X (m ^ 


Z = 


(•^1 ) • • • ) -^ni ) 5 




Vik) = 






7r(fc) = 


{7rk{xi;$), . . . ,7rfc(a;„,;/3))^. 





Here the operator © denotes the Hadamard product, which means the elementwise prod- 
uct of matrices; that is, Aij Bij = {aijbij) for matrices Aij = (aij) and Bij = ipij). 

4.2 Generalized Bayesian information criterion 

In Bayesian inference, Schwarz (1978) presented the Bayesian information criterion (BIC) 
from the viewpoint of maximizing a marginal likelihood. However, the BIC covers only 
models estimated by the maximum likelihood method. 

By extending the Schwarz's (1978) idea, Konishi et al. (2004) derived a novel Bayesian 
information criterion to evaluate models estimated by regularization in the framework of 
generalized linear models. Hence, by using the result given in Konishi et al. (2004), we 
present a generalized Bayesian information criterion (GBIC) for evaluating the statistical 
model constructed by the semi-supervised functional logistic modeling procedure in the 
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form 

711 L—1 

GBIC = -2 ^ log /3) + n,X ^ $lK^k - {L ~ 1) log | + 

«=1 k=l 

+ log|i?(/§)|-(L-l)(m + l-rf)logA-(L-l)rflog(^^^ , (25) 

where R{/3) is given by Equation and is the product of the positive eigenvalues 
of K with the rank d. 

We thus select a tuning parameter A by minimizing either the model selection criterion 
GIG or GBIG. For more details of derivations about the model selection criteria, we refer 
to Konishi and Kitagawa (2008). 

5 Numerical studies 

We conducted some numerical examples to investigate the effectiveness of the proposed 
modeling procedure. Monte Garlo simulations and a real data analysis are given to illus- 
trate our proposed semi-supervised functional modeling strategy. 

5.1 Monte Carlo simulations 

We demonstrated the efficiency of the proposed functional modeling procedure through 
Monte Garlo simulations. In the simulation study, we generated n discrete samples 
{{xati.ga}', a = 1, . . . ,n, i = 1, . . . J}, where predictors Xau are assumed to be obtained 
by Xati = ha(ti) + Eati and the class label ga indicates 1 or 2 which is the group number. 
We considered two settings as follows: 

Case 1 

2i-2 

K{ti) = Sm{Cati7l)Ua, Eat, ~ ^^(0,0.1), = , Tl = 600, / = 50, 

ga = I : Ca = I, Ma ~ U[0.3, 1.3], 
g^ = 2:ca = 1.02, Ua ~ f/[0. 1,0.6], 
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(a) (b) 

Figure 2: True functions for (a) Case 1 and (b) Case 2. In each case, there are 10 subjects. 
Sohd hues represent the group 1, while dashed hues represent the group 2. 

Case 2 

i + 4 

K{ti) = UaW{ti) + (1 - Ua)v{ti), Eau ~ ^^(0, 1), U = , U = 600, / = 101, 

ga = 1 '■ Ua ^ U[0, 1], w(ti) = max(6 — |tj — 11|, 0), v(ti) = max(6 — |tj — 11|, 0) — 4, 
ga = 2 : Ua ^ U[0, 1], w{ti) = max(6 — |tj — 11|, 0), v(ti) = max(6 — |tj — 11|, 0) + 4. 

Figure [2] denotes the true functions h{t) for the Cases 1 and the Case 2, respectively. 
We divided the data set into 300 training data and 300 test data with an equal prior 
probability for each class. In order to implement the semi-supervised method, the training 
data were randomly divided into two halves with labeled functional data and unlabeled 
functional data, where the labeled functional data were assigned as 5%, 10%, 20%, 30%, 
40%, 50% and 60% of the training data, respectively. 

We compared the performances of semi-supervised functional logistic model (SFLDA) 
with those of supervised functional logistic model (FLDA) proposed by Araki et al. 
(2009b), support vector machine with the RBF kernel (SVM), /c- nearest neighbor classifi- 
cation (KNN), functional support vector machine with the RBF kernel (FSVM) proposed 
by Rossi and Villa (2006), and semi-supervised methods proposed by Zhou et al. (2004) 
(LLGC: learning with local and global consistency) and Yu et al. (2004) (ILLGC: induc- 
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tive learning with local and global consistency). The discrete data set was transformed 
into a functional data set using the smoothing technique described in Section 2. Semi- 
supervised and supervised functional modeling strategies (i.e., SFLDA, FLDA and FSVM) 
were applied into the functional data set. The regularization parameter in the SFLDA 
and the FLDA was selected by using the GIC or the GBIC. For the GIC or the GBIC 
of the FLDA, we refer to Araki et al. (2009a; 2009b). Adjusted parameters included in 
the SVM, the FSVM, the LLGC and the ILLGC were optimized by the five-fold cross 
validation, respectively. The number of neighbors k in the KNN was selected by the 
leave-one-out cross validation. 

Tables [1] and [2] show comparisons of the test error rates for the simulated data. These 
values were averaged over 50 repetitions. The average values of the tuning parameter A 
for 50 runs of the Case 1 were A = 5.96 x 10"^ for the GIG and A = 9.48 x 10"^ for the 
GBIG, while those of the Case 2 were A = 1.00 x 10"^ for the GIC and A = 2.28 x 10"^ 
for the GBIC. For the Case 1, we observe that the SFLDA methods evaluated by the GIC 
and the GBIC are superior to other methods except for the FLDA methods in almost all 
cases. Also, our proposed methods SFLDA seem to provide lower misclassification errors 
than the FLDA methods, when the size of labeled functional data is small (e.g., 10% of 
training data). In the case of the Case 2, the SFLDA methods outperform the SVM, the 
KNN, the FSVM, the LLGC and the ILLGC in all situations with respect to minimizing 
the test errors. In addition, the proposed procedures SFLDA may be competitive or 
slightly superior to the FLDA methods. 

5.2 Microarray data analysis 

We describe an application of the semi-supervised functional discriminant analysis to 
yeast gene expression data given in Spellman et al. (1998). This data set contains 77 
microarrays and consists of two short time-courses (i.e., two time points) and four medium 
time-courses (18, 24, 17 and 14 time points). About 800 genes were classified into five 
different cell-cycle phases, namely, M/Gl, Gl, S, S/G2 and G2/M phases, while the other 
5,378 genes were not classified. For more details of this data set, we refer to Spellman et 
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Table 1: Comparison of test errors with different percentages of labeled functional data in 
the training data set for the Case 1. Figures in parentheses indicate the model selection 
criteria used in the simulation study. 
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al. (1998). 

In our analysis, we used the "cdcl5-based experiment data" sampled over 24 points 
after synchronization. For simplicity, any genes that contain missing values across any 
of the 24 time points were discarded. These expression data were considered to be a 
discretized realization of 632 expression curves evaluated at 24 time points. We function- 
alized the data using the smoothing methodology given in Section 2. A total of 300 genes 
were used as the training data set, and the remaining 332 genes were used as the test data 
set. We compared the SFLDA, which is our proposed semi-supervised functional method, 
with the FLDA, which is the supervised functional method. 

First, we demonstrated the effectiveness of our semi-supervised methodology by set- 
ting functonal data with known class labels as unlabeled functional data. We randomly 
split the training data set into labeled functional data and unlabeled functional data, 
where 15%, 20%, 30%, 40% and 50% of training data are allocated as labeled functional 
data, respectively, and we repeated the procedures 10 times. The values of the selected 
regularization parameter for 10 runs were A = 2.80 x 10~^ for the GIC and A = 7.78 x 10~^ 
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Table 2: Comparison of test errors with different percentages of labeled functional data in 
the training data set for the Case 2. Figures in parentheses indicate the model selection 
criteria used in the simulation study. 
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for the GBIC. Figure [3] shows the average precisions of the test data set for different ratios 
of labeled-unlabeled functional data in the training data set. On the x-axis, 15 means that 
15% of the training data was assigned as labeled functional data, and the remaining 85% 
was used as unlabeled functional data. From the left panel of Figure 3, we observe that 
the SFLDA with the GIC seems to extract useful information from unlabeled functional 
data, since the SFLDA performs better than the FLDA in all cases. In contrast, the 
right panel of Figure 3 shows that the SFLDA is superior to the FLDA until 30% labeled 
functional data, whereas the SFLDA is comparable to the FLDA in the range from 30% 
to 50% labeled functional data. 

Second, we examined the performances of our methods by using real unlabeled func- 
tional data which were not classified by Spellman et al. (1998). We prepared labeled 
functional data which consist of 20%, 25%, 30%, 40%, 50% and 60% of the training data, 
while unlabeled functional data are set to 500 samples randomly selected from 5,378 real 
unlabeled examples. Our proposed models and the supervised functional models were ap- 
plied into the data set. We repeated these procedures 10 times. We obtained the averaged 
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Figure 3: Average prediction errors for several ratios of labeled functional data in the 
training data set. Solid line shows the result of the SFLDA while dashed line shows that 
of the FLDA. The left-hand panel indicates the results for the methods evaluated by the 
GIC, whereas the right-hand panel indicates those by the GBIC. 

optimal values of the regularization parameter for 10 repetitions as A = 1.00 x 10~^ for 
the GIC and A = 7.85 x 10~^ for the GBIC. Figure H] shows the average test error rates for 
various ratios of labeled functional data in the training data set. For the left-hand panel of 
Figure HI the SFLDA outperforms the FLDA without 20% labeled functional data, while 
the SFLDA gives lower prediction errors than the FLDA on 20% labeled functional data. 
Hence, these results suggest that real unlabeled functional data included in Spellman's 
et al. (1998) data set may have a potential for improving a prediction accuracy of our 
functional logistic procedures. 

6 Concluding remarks 

We proposed a semi-supervised functional logistic modeling procedure for the multi-class 
classification problem with the help of regularization. On the step of functionalization, a 
smoothing method using Gaussian basis expansions was applied to the observed discrete 
data set. A crucial issue for our semi-supervised modeling process is the choice of the regu- 
larization parameter A. In order to select the value of the parameter, we introduced model 
selection criteria from the viewpoints of information-theoretic and Bayesian approaches. 
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Figure 4: Average prediction errors for several ratios of labeled functional data in the 
training data set, where we use real unlabeled functional data. Solid line shows the result 
of the SFLDA while dashed line shows that of the FLDA. The left-hand panel indicates 
the results for the methods evaluated by the GIC, whereas the right-hand panel indicates 
those by the GBIC. 

Monte Carlo simulations and a microarray data analysis showed that our modeling strat- 
egy yields relatively lower prediction error rates than previously developed methods. A 
further research should be to construct a semi-supervised functional regression modeling 
or clustering. 
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